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SELECTIVELY DEGLYCOS YLATED HUMAN 
IMMUNODEFICIENCY VIRUS TYPE 1 ENVELOPE VACCINES 

Background of the Invention 
5 The field of the invention is human 

immunodeficiency virus vaccines and iiamunotherapeutics . 

The invention was supported by the U.S. Government 
which has certain rights in the invention. 

Human immunodeficiency virus is the etiological 
10 agent of acquired immune deficiency syndrome (AIDS) . The 
env gene of HIV encodes a 160 kD glycoprotein that is 
subsequently cleaved into two smaller species, an 
extracellular (or surface) protein gpl20 and a 
transmembrane protein gp41 (Allan et al., 1985, Science 
15 228:1091; Di 'Marzo-Veronese et al., 1985, Science 

229:1402). Gpl20 is noncovalently linked to gp41 (Allan 
et al., 1985, Science 228:1091; Chou et al. , 1988, J. 
Infect. Dis. 157:805; Di 'Marzo-Veronese et al., 1985, 
Science 229:1402; Lasky et al., 1987, Cell 50:975). 

2 0 Among the various HIV isolates, some sequences are 

highly conserved and some are variable. Two 
characteristics of the env glycoprotein are conservation 
of cysteine residues and of a relatively large number of 
N-linked carbohydrate sites in HIV-l isolates. Similar 
25 secondary and tertiary structures for the env 

glycoprotein have been suggested based on the similarity 
of the sequences of HIV. 

The env glycoprotein is heavily glycosylated. The 
unmodified polypeptide backbone of gpl20 (about 480 amino 

3 0 acids) weighs about 55 kD. About one half of the 

molecular weight of gpl20 can be accounted for by 
attached carbohydrates (Allan et al. , 1985, Science 
228:1091; Geyer et al., 1988, J. Biol. Chem. 263:11760; 
Matthews et al., 1987, Proc. Natl. Acad. Sci. USA 
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84:5424; Mizuochi et al. , 1988, Biochem J. 254:599; Robey 
et al. f 1985, Science 228:593). Although gp41 is also a 
glycoprotein, it is not as heavily glycosylated as gpl20 
(Di'Marzo-Veronese et al. , 1985, Science 229:1402). The 
5 oligosaccharides of the gpl20/41 complex are generally N— * 
linked with no detectable O-linked sugar residues present 
(Kozarsky et al., 1989, J. AIDS 2:163; Leonard et al. , 
1990, J. Biol. Chem. 265:10373). The consensus sequence 
of the site for N-linked carbohydrate attachment is Asn- 

10 X-Ser/Thr, where X is any amino acid except Pro and Asp. 
HIV-1 molecular clones contain an average of 23-24 
potential N-linked carbohydrate attachment sites on gpl20 
and about 4-7 on gp41. The consensus sites on gpl20 are 
generally glycosylated when the env protein is expressed 

15 in Chinese hamster ovary (CHO) cells (Leonard et al. , 
1990, J. Biol. Chem. 265:10373). 

CD4 is the host cell receptor for HIV (Dalgleish 
et al., 1984, Nature 312:763; Klatzmann et al., 1984, 
Nature 312:767; McDougal et al. r 1986, Science 231:382). 

20 The CD4 -binding domain of HIV has been mapped to the C- 
terminal region of gpl20 (Kowalski et al. , 1987, Science 
231:1351; Lasky et al. , 1987, Cell 50:975), although it 
is reported that sequences in the N-terminal region of 
gpl20 may also be involved (Syu et al., 1990, Proc. Natl. 

25 Acad. Sci. USA 87:3695). 

Vaccines and immunotherapeutics comprising native 
gpl20 and gpl60 have been proposed. 

Summary of the Invention 
We have discovered that selectively deglycosylated 

3 0 HIV-l envelope proteins retain their ability to support 
viral infectivity, implying that they generally retain 
the native envelope conformation. We also noted that the 
envelope protein of the related simian virus for African ^ 
green monkeys (SIV AGM ) , which is not pathogenic to its 

35 natural host, has fewer N-linked, glycosylation sites, 
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particularly in the C-terminal portion of the surface 
envelope protein analogous to gpl20. Without wishing to 
bind ourselves to a specific detailed molecular 
explanation, we propose that a selectively deglycosylated 
5 HIV-1 envelope protein is more effective in eliciting a 
protective immune response in people, Glycosylation 
serves to reduce or prevent immunological recognition of 
envelope protein domains. Selective deglycosylation 
enables an immune response to these domains and improves 
10 the opportunity for a protective immune response, 

Deglycosylation which produces substantial conformational 
changes (as determined by loss of infectivity) should be 
avoided. 

We have further found that the invention can be * 

15 achieved by generating recombinant HIV-1 envelope 

glycoproteins which have primary amino acid sequence 
mutation(s) in consensus sequence (s) for N-linked 
glycosylation (sugar attachment) , so as to prevent 
glycosylation at that site(s) . Moreover, we have found 

2 0 that the position of such genetic deglycosylation is 
important. Preferably, the position of such genetic 
deglycosylation should be between the C terminus of gpl20 
and the Cys at the N-terminal side of the cysteine loop 
containing the hypervariable region 3 (V3) (this Cys is 

25 generally positioned about at residue 296, counting from 
the N-terminus of gpl20) . We have found that it is 
important to remove at least a minimum amount of the 
total native gpl2 0 carbohydrate in order to maximize the 
opportunity for a useful immune response. Specifically, 

30 the mutant glycoprotein should be deglycosylated such 
that the total molecular mass of the mutant gpl20 
component is less than 90% (more preferably 75%) of the 
corresponding fully glycosylated wild type gpl2 0 
component . 
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Another indicia of a suitable conformation for a 
desirable immune response is infectivity — i.e. , the 
mutant glycoprotein (when present as a component of a 
complete HIV-1 virion) enables viral infectivity. By 
5 retaining viral infectivity , we mean that when the 
envelope gene of HIV or an infectious DNA clone is 
engineered to encode the mutations of the* mutant envelope 
glycoprotein , the virus retains infectivity. 

By wild-type or native HIV-1 envelope glycoprotein 

10 we mean the envelope glycoprotein encoded by a naturally 
occurring HIV-1 isolate. With respect to designation of 
amino acid positions of the envelope glycoprotein such as 
the Cys at the N-terminal side of the cyteine loop 
containing V3 (approximately amino acid position 296) , it 

15 will be understood that certain aspects of envelope 

structure are conserved throughout virtually all HIV-1 
strains , and those conserved structures can be used as 
landmarks. For example cysteine cross-links form loops 
which contain hypervariable regions having widely 

20 accepted designations. 

By the term "recombinant glycoprotein" we mean a 
glycoprotein produced by expression of a DNA sequence 
that does not occur in nature and which results from 
human manipulations of DNA bases. The term envelope 

25 glycoprotein means gpl60, gpl20, or other env-encoded 
peptides containing at least the above-described C- 
terminal portion of gpl20. 

Accordingly, one aspect of the invention features 
compositions comprising mutant selectively deglycosylated 

3 0 HIV-1 recombinant envelope glycoproteins as described 
above. Other aspects of the invention feature vaccines 
(both for protecting uninfected individuals and for 
treating infected individuals) that comprise such mutant 
HIV-1 recombinant envelope proteins. Still other aspects 

3 5 of the invention feature DNA encoding the mutant HIV-1 
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recombinant envelope proteins (particularly in an 
expression vector) , recombinant cells comprising such 
DNA, and methods of making the recombinant mutant 
envelope glycoproteins by expressing such DNA. Still 
5 another aspect of the invention features antibodies 

raised, or preferentially binding to, the mutant envelope 
glycoprotein ♦ 

In preferred embodiments, mutants of either gp!20 
or gpl60 can be used. Because the deglycosylation 

10 unmasks envelope regions which are generally conserved, 
it is possible to use any of a wide range of HIV-1 
strains or isolates e.g., MN, HXB2, LAI, NL43, MFA, BRVA, 
SC, JH3, ALAI , BALI, JRCSF, OYI, SF2 , NY5CG, SF162, JFL, 
CDC4, SF33, AN, ADA, WMJ2 , RF, ELI, 22Z6, NDK, JY1, MAL,- : 

15 U455, Z321. The preferred mutation at the consensus N- - 
linked glycosylation sequence is substitution of Asn, Ser 
or Thr with a different amino acid (i.e., any amino acid- 
other than the one occupying the position in the wild 
type) . Preferably, there are multiple deglycosylations 

20 in the above described C-terminal region, particularly in 
the region between the C terminus of gpl20 and the Cys on 
the N-terminal side of the cysteine loop containing 
hypervariable region 4 (V4) . For example, one or more of 
the positions 386, 392, 397, 406 or 463 may be 

25 deglycosylated. We have found that in some cases the 
consensus sequence closest to position 448 and/or 
position 392 may be mutated, together with other C- 
terminal consensus sequence mutations. We have also 
found that it is preferable to maintain glycosylation at 

30 the consensus sequence closest to position 289. It may 
also be desirable in some constructions to maintain 
glycosylation at position 356. For convenience the 
numbers given above gpl20 refer to amino acid residues of 
the HXB2 envelope protein. Those skilled in the field 

3 5 will understand that conservation of envelope features in 
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other strains will permit the application of the 
invention to the envelope proteins of those strains. For 
example, there is conservation of cysteine cross-links 
that define loops with hypervariable regions. Thus, the 
5 reference to positions 386, 392, 397, 406 and 463 can be 
understood as a reference to the N- linked glycosylation 
sites positioned between the C-terminus of gpl20 and the 
Cys on the N-terminal side of the cysteine loop 
containing hypervariable region 4 (V4) . Similarly, the 

10 reference to positions 289 and 356 can be applied to 
other strains with reference to Fig. 1 and Fig. 2. 

Other features and advantages of the invention 
will be apparent from the following description of 
preferred embodiments and from the claims. 

15 Detailed Description 

The drawings are first briefly described. 
Drawings 

Figure l is a diagram depicting the conservation 
of N-linked glycosylation sites in gpl20 of selected HIV- 

20 1 isolates. Twenty-four consensus N-linked glycosylation 
sites of HXB2 are shown by lines. The numbers above each 
line indicate the amino acid positions in HXB2 . The 
longer lines with an asterisk symbol represent N-linked 
glycosylation sites not present in HXB2 . 

25 Figure 2 is a schematic drawing of gpl20. 

Darkened lines represent the hypervariable regions of the 
molecule which form 5 loops, designated Vl-5, via 
cysteine-cysteine disulfide bonds which are represented 
by the solid lines connecting each end of a loop. The 

30 numbers represent the first amino acid in each of the 24 
potential N-linked glycosylation sites in the molecule. 

Figure 3 is a schematic diagram of gpl20 from HIV- 
1. The distribution and amount of conservation of N- 
linked glycosylation sites is shown. Amino acids are 

3 5 numbered from the N- terminus of the molecule to the C- 
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terminus. The numbers beneath the diagram denote the 
position of the first amino acid in the consensus 
sequence of an N-linked glycosylation site. Sites which 
are > 90% conserved among HIV-1, HIV-2 and SIV isolates 
5 are indicated by an arrow with a solid head and are 

numbered sequentially with the prefix 'a'. Sites which 
are at least 50% conserved are indicated by an arrow with 
an open head and are numbered sequentially with the 
prefix 'b'. Other sites which are conserved at a level 
10 of less than 50% are indicated by an arrow with a wavy 
tail. 

Figure 4 is a western blot demonstrating 
expression of gpl60 and gpl20 in COS-l cells transfected 
with wild type or mutant proviral DNA. Cell lysates from 
15 transfected COS-l cells were separated on 12% SDS- 
polyacrylamide gels, transferred to nitrocellulose 
filters, and then reacted with a reference sheep anti- 
gpl20 serum. The wild type virus is abbreviated WT and 
N-linked glycosylation mutants are indicated by numbers 
20 representing their position in HXB2 . 

Figure 5 is a graphical demonstration of infection 
of CD4-positive SupTl cells by N-linked glycosylation 
mutants. Reverse transcriptase activity in cultured 
supernatants of SupTl cells infected by wild type (WT) 
25 virus and by mutant viruses 141 or 197, was measured over 
a period . of 28 days. The growth kinetics of mutants 88, 
160 and 276 were similar to those of mutant 141. The 
growth kinetics of mutant 262 was similar to those of 
mutant 197. The growth kinetics of other first-site N- 
30 linked glycosylation mutants were similar to those of 
wild type virus. 

Figure 6 is a western blot analysis of the 
envelope glycoproteins expressed by wild type and mutant 
viruses. COS-7 cell lysates were prepared 48 hours post- 
35 transfection and electrophoresed on 12% SDS-PAGE, 
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transferred to nitrocellulose, and reacted with sheep 
anti-gpl20 antisera. (A) Mock, wild type and C2 , C3, C4, 
C5 and C6 mutants. 

(B) Mock, wild type, N2, N3 , N4 and N5 mutants. 
5 Figure 7 is a graphical demonstration of RT 

activity in SupTl cells infected with wild type and 
mutant viruses. (A) Mock, wild type and C2, C3 , C4, C5 
and C6 mutants. (B) Mock, wild type and N2, N3, N4 and 
N5 mutants. 

10 Generation of Molecules Useful as Vaccine Candidates for 
HIV-1 

As outlined above, proteins according to the 
invention are recombinant human immunodeficiency virus 
envelope glycoproteins which are mutated with respect to 

15 a wild type (native) human immunodeficiency virus 

glycoprotein in the primary amino acid sequence to effect 
partial deglycosylation. The genetic change should be 
introduced to positions in the C-terminal portion of 
gpl20 (between the C-terminus of gpl20 and a specific 

20 cysteine which forms the loop containing V3) . 

Notwithstanding the mutation (s) , the conformation of the 
glycoprotein remains sufficiently intact to maintain 
infectivity when present as a component of the virion. 
We propose that, in individuals that are immunized with 

25 this molecule, an immune response will be induced to 
reduce or block viral infectivity. 

As illustrated by the studies described below, 
potential N-linked glycosylation sites in gpl20 can be 
systematically mutated, either singly or in combination 

3 0 by site directed mutagenesis such that the consensus 

glycosylation sequence is disrupted. Recombinant viruses 
are generated containing gpl20 genes that have such 
mutations. To determine whether the conformation is 
retained in the mutated gp!20, the infectivity of each 
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mutant virus is measured. Processing of gpl60 to gpl20 
and gp41 may also be assessed as a rough measure of 
retention of conformation and infectivity. 

In general there are more than 20 consensus N- 
5 linked glycosylation sites in the gpl20 coding sequence 
of HIV-1 isolates. For illustrative purposes, we have 
shown the positions of these sites on gpl20 in HXB2 and 
in other strains of HIV-1 in Fig. 1. The relative 
positions of these sites on the predicted structure of 
10 gpl20 in HXB2 are shown in Fig, 2. A linear map of the 
conserved N-linked glycosylation sites, their relative 
positions and their level of conservation are presented 
in Fig. 3. 

In Figure 3 , the following residue designations 
15 correspond to the arrows of gpl20: 
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230 not 
marked 

25 b4 = 234 b8 = 339 

Sequence information for envelope proteins of other 
strains (e.g. the strains listed above) are referenced in 
Myers et al. Human Retroviruses and AIDS (1991) : "A 
compilation and analysis for nucleic acid and amino acid 
30 sequences" (Los Alamos National Laboratory, Los Alamos, 
NM) , which is hereby incorporated by reference. 

The following studies are provided to illustrate 
(not to limit) the invention, and particularly to 
illustrate methods for readily determining the relative 
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importance of each of the various HIV envelope N-linked 
glycosylation sites and the effect of mutations to those 
sites and combinations thereof. 

Mutation of Potential N-Linked Glycosylation Sites and 
5 the Effect of these Mutations on Envelope Glycoprotein 
Viral Infectivity 

The molecular clone HXB2, which contains 24 N- 
linked glycosylation sites was used as the template DNA 
for site-directed mutagenesis as follows. 

10 Construction of mutants 

Oligonucleotide-directed mutagenesis was performed 
on a 2.7 Kb Sall-BamHI fragment of HXB2 (Cohen et al., 
1990, J. AIDS 13:11), which covers all 24 N-linked 
glycosylation sites of gpl20, using the method of Kunkel 

15 (Cohen et al. , 1988, Nature 334:532.) . The 

oligonucleotide primers used for mutagenesis were 
synthesized using standard cyanoethyl phosphoamidite 
chemistry and are listed in Table I. Mutants were 
identified by the Sanger chain-termination method 

20 (Cullen, 1986, Cell 46:973). The Sall-BamHI fragment 
containing the desired mutation was excised from the 
replicative form of each mutant and used to replace the 
2.7 Kb Sall-BamHI fragment of HXB2 . All HXB2 -derived N- 
linked glycosylation site mutants containing the 

25 designated changes were further verified by DNA 
seguencing (Cullen, 1986, Cell 46:973). 

Western blot analysis of envelope proteins 

Ten micrograms of wild type or mutant DNA was 
transfected into 3-5 x 10 6 COS-1 cells using DEAE-dextran 
30 (Curran et al. , 1988, Science 239:610). Cells lysates 
were collected 48 hours after transf ection. Mock- 
trans fected, wild type, and mutant transfected COS-1 
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cells were washed with phosphate-buffered saline (PBS) 
once and subjected to centrif ugation at 2500 rpms. Cell 
pellets were resuspended with 100 ml RIPA lysis buffer 
(0.15 M NaCl/0.05 M Tris HC1 pH 7.2, 1% Triton X-100, 1% 
5 Sodium deoxycholate, 0.1% SDS) and spun down at 35,000 
rpm (Ti70 rotor; Beckman) at 4°C for 40 minutes. Ten 
microliters of cell lysates were electrophoresed in 12% 
SDS-polyacrylamide gels. A reference HIV-1 positive 
serum at a 1:200 dilution and a sheep anti-gpl20 (AIDS 
10 Research Reference Reagent Program #288) at 1:2000 
dilution were used for western blots as described 
(Dalgleish et al., 1984, Nature 312:763). 

Monitoring of syncvtium-f ormation and viral infectivitv 
The CD4 positive human T lymphoid cell line, 

15 SupTl, was grown and maintained at 37 °C in RPMI-1640 

containing 10% heat-inactivated fetal bovine serum and 1% 
penicillin-streptomycin. COS-1 cells were propagated in 
Dulbecco's minimal eagle medium supplemented with 10% - 
heat-inactivated fetal bovine serum and 1% penicillin- : 

20 streptomycin. Cell-free supernatants were collected 48 
hours after transf ection. Supernatants were filtered 
through 0.45 mm filters and assayed for virion-associated 
reverse transcriptase (RT) activity. Equal amounts of 
wild type and mutant virus, as measured by RT activity 

25 (100K cpm) , was used to infect 1 x 10 6 SupTl cells. One 
milliliter of the culture medium was collected every 
three or four days and assayed for RT. Cultures were 
monitored for 28 days. 

Reverse transcriptase assay 
30 One milliliter of culture medium was mixed with 

0.5 ml 3 0% PEG and 0.4M NaCl on ice for 2 hours and spun 
at 2500 rpm at 4°C for 30 minutes. The pellet was 
resuspended in 100 ml of RT buffer (0.5% Triton X-100, 15 
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roM Tris pH 7.4, 3 mM dithiothreitol, 500 mM KCL, 30% 
glycerol) . Ten microliters of the solution was incubated 
with 90 ml of RT cocktail (40 mM Tris HCL, pH 7.8, 10 mM 
MgCl 2 , 8mM dithiothreitol, 94 ml ddH 2 0, 0.4 U Poly (rA) 
5 oligo (dT) [optical density at 260 nm] per ml and 2.5 
mCi/ml 3 H- labeled dTTP) at 37°C for 1.5 hours. The 
reaction mixture was precipitated with 3 ml of 10% 
trichloroacetic acid (TCA) and 10 ml of 1% tRNA which 
served as the carrier, and was then chilled on ice for 20 
10 minutes. The reaction mixture was filtered through 

Whatman GF/C glass microfiber filters and washed 3 times 
with 5% TCA to remove unincorporated 3 H-dTTP. 
Radioactivity was measured in a liquid scintillation 
counter. 

15 Single Mutants in gpl20 

The ability of HXB2 -derived .mutants (each having 
one of the 24 N-linked glycosylation sites mutated by 
site-directed mutagenesis) to infect CD4-positive SupTl 
cells was compared with that of the wild type virus and 

20 the results are described below. Most of the individual 
consensus N-linked glycosylation sites are dispensable 
for viral infectivity. N-linked glycosylation sites that 
are likely to play important roles in HIV-l infectivity 
are not randomly distributed in gpl20; they are generally 

25 located in the N-terminal half of gpl20. 

Since deglycosylation of proteins can improve 
their immunogenic ity, a candidate vaccine for HIV-l might 
be a partially glycosylated gpl20 with most of the 
dispensable N-linked glycosylation sites removed, such 

3 0 that the conformation of the protein is largely unaltered 
and the CD4 binding site is retained. 

Each of the 24 potential N-linked glycosylation 
sites in the gpl2 0 coding region of the infectious 
molecular clone HXB2, was. individually modified to 
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generate 24 N-linked glycosylation site mutants (Table 
1) . In these mutants, the Asn-X-Ser/Thr attachment 
sequence was replaced by either Gln-X-Ser/Thr or His-X- 
Ser/Thr. The underlying hypothesis was that if a given 
5 N-linked glycosylation site played no significant role in 
syncytium-f ormation or viral infectivity, then such a 
mutant should retain its infectivity and its ability to 
form syncytia. Each of the 24 mutants was designated by 
the residue number of the respective N-linked 
10 glycosylation site (Table 1) . 

Expression of envelope proteins 

To determine if mutations introduced to any of the 
24 N-linked glycosylation sites affected the expression 
of the envelope protein, 10 /xg each of mutant or wild 

15 type proviral DNA was transfected into 3-5 x 10 6 COS-1 
cells using DEAE-dextran as described above. Cell 
lysates derived from COS-1 transf ectants were then 
examined in western blots as described above. As shown 
in Fig. 4, Both gpl60 and gpl20 were detected in all 24> 

20 mutants, suggesting that no particular individual N- 
linked glycosylation site was indispensable for the 
expression of the envelope protein. 

Syncytium-f ormation and viral infectivity 

To evaluate whether mutations introduced into any 

2 5 of the individual N-linked glycosylation sites affected 

syncytium-f ormation and viral infectivity, cell-free 
virions obtained from the culture supernatant of COS-1 
transf ectants were collected at 48 hours post- 
transfection. Equal amounts of mutant and wild type 

3 0 viruses, as measured by RT activity, were used to infect 

CD4-positive SupTl cells. Virus-infected cultures were 
monitored for syncytium formation and RT activity. As in 
the case of the wild type virus-infected SupTl cultures, 
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syncytia and RT activity were detected in all the mutant 
virus- infected SupTl cultures (Table 1) . However, 6 
mutant viruses, mutants 88, 141, 160, 197, 262 and 276, 
exhibited delays in growth kinetics when compared with 
5 the wild type virus (Table 1) • 

Third-site N-linked glycosvlation mutants 

To examine whether the observed effect on viral 
infectivity in mutants 88, 141, 160, 197, 262, and 276 
was due to amino acid substitutions introduced to replace 

10 the asparagine residue of the canonical N-linked 

glycosylation sequence with a non-canonical residue, six 
third-site N-linked glycosylation mutants were 
constructed (Table 2) . These six mutants, designated 90, 
143, 162, 199, 264 and 278, are called third-site mutants 

15 because they had the Ser/Thr residue of the Asn-X-Ser/Thr 
sequence replaced by a different amino acid residue. 

The ability of these six third-site mutants to 
infect CD4 -positive SupTl cells was also examined. If 
the phenotype of a third-site N-linked glycosylation 

20 mutant is similar to that of the wild type virus, it is 
likely that the observed defect in infectivity for the 
corresponding first-site mutant is the result of amino 
acid substitution at the first site rather than the loss 
of that particular N-linked glycosylation site. For 

25 instance, mutant 162 was indeed found to have similar 
growth kinetics to the wild type virus (Table 2) . This 
suggested that the impairment of viral infectivity 
observed for mutant 160 in SupTl cells was likely due to 
the substitution of asparagine residue with a glutamine 

30 residue at this particular consensus N-linked 

glycosylation site; but not due to the loss of this 
particular consensus N-linked glycosylation site. The 
remaining five third-site N-linked glycosylation mutants, 
like their respective first-site mutants , all showed 
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partial impairment in infectivity when compared with the 
wild type virus (Table 2) . 



Mutations Introduced at Combinations of N-Linked 
Glycosylation Sites 
5 Additional mutants in potential N-linked 

glycosylation sites in gpl20 were generated by 
oligodeoxynucleotide directed mutagenesis as described 
above. The 2.7 Kb Sall-BamHI fragment of the molecular 
provirus clone HXB2 , was cloned into bacteriophage 

10 M13mpl8 at Sall-BamHI sites and was used as the template 
for mutagenesis. The oligonucleotides used for the 
mutagenesis are listed in the Table 1. Changes were made 
from the consensus N- linked glycosylation sequence Asn-X- 
Ser/Thr (N-X-S/T) to either Gln-X-Ser/Thr (Q-X-S/T) or 

15 His-X-Ser/Thr (H-X-S/T) . Five mutants were generated 
each of which was altered at the amino acids contained 
within the parentheses as follows: C2 , (386/486); 
C3 (397/463); C4 (386/392/397/463); C5 

(386/392/397/406/463); and C6 (386/392/397/406/448/463) 

2 0 (Table 3) . The mutations were confirmed by Sanger - 

sequencing (Sanger et al., 1977, Proc. Natl. Acad, Sci. 
USA 74:5463) . 

Expression of envelope proteins and effect of 
combinations of mutations on viral infectivity 
25 Mutant proviral DNA and wild type DNA (3 fxg) was 

transfected into 3xl0 6 COS-7 cells (a monkey kidney cell 
line, CV-1, origin minus, SV40) using DEAE-dextran as 
described above. Cell lysates from COS-7 transfected 
cells collected 48 hours after transfection were examined 

3 0 by western blotting. Proteins were separated by SDS- 

polyacrylamide gel electrophoresis, transferred to 
nitrocellulose, and reacted with sheep anti-gpl20 
antisera (Chou et al., 1988, J. Infect. Dis. 157:805) 
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(Fig- 4) . Wild type DNA and all of the C-terminal 
mutants C2, C3 , C4, C5, and C6 expressed gpl60 and gp!20 
proteins at ratios similar to each other demonstrating 
that the position of the mutations has no apparent effect 
5 on cleavage of gpl60 to gpl20 and gp41. However, the 
mobilities of the mutated proteins is higher (faster) 
than those of the wild type (Fig. 6, Top) suggesting that 
some carbohydrates have been removed from these mutant 
proteins. In conclusion, oligosaccharides at the C- 

10 terminal region of gpl20 appear to be dispensable for 
cleavage of gpl60 to gpl20/gp41. 

To test the effect of the removal of carbohydrates 
from the C-terminal region of gpl20 on viral infectivity, 
cell free virus obtained from these mutants was used to 

15 infect the CD4-positive T cell line, SupTl. Supernatants 
were collected from COS-7 transfected cells 48 hours 
post-transf ection. RT assays were performed and were 
used as a measure of the amount of virus in the 
supernatant. An equal amount of virus, adjusted to an RT 

20 activity of approximately 40 OK cpm, was used to infect 
4xl0 6 SupTl cells. The infectivity of wild type and 
mutant viruses was determined by examining the cultures 
for the formation of syncytia and by measuring RT 
activity as described above. Syncytia were apparent in 

25 cultures infected with each of the mutants beginning at 
day 4 postinfection and the formation of syncytia 
progressed with similar kinetics in each culture (Fig. 
7 A) . Thus, the carbohydrates at C-terminal of gpl20, 
which encompass the CD4 -binding region, are not essential 

3 0 for viral infectivity. 

To determine whether other regions of gpl20 could 
be deglycosylated without affecting processing of gpl60 
and infectivity, another heavily glycosylated region 
located at the N-terminus of gpl20, from cysteine 126 to 

35 cysteine 196, was mutated. The oligonucleotides used for 
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mutagenesis are summarized in Table 1. Four N-terminal 
mutants, N2 (141/186), N3 (141/160/186), N4 
(136/141/160/186), and N5 (136/141/156/160/186) were 
generated (Table 3). Mutants N3 , N4 and N5 were 
5 defective in processing of gpl60. Cultures infected with 
mutant N2, which had two mutated N-linked glycosylation 
sites formed syncytium at day 4 post-infection and had a 
higher RT activity than that of the wild type (Figure 
7B) . In contrast, removal of more than three N-linked 

10 glycosylation sites (mutants N3, N4, and N5) in the N- 
terminal region of gpl20 significantly reduced viral 
infectivity, in that no syncytia could be observed at any 
time postinfection. 

The data described above demonstrate that six N- 

15 linked glycosylation sites at the C-terminal of gpi2 0 
spanning the CD4 -binding region are not essential for 
processing of gpl60 or for viral infectivity. Binding of 
gpl2 0 to CD4 is essential for infection of CD4-positive T 
cells* The data described above suggest that 

20 carbohydrates that cover the CD4 binding region are not 
important for the gpl20/CD4 interaction. However, 
carbohydrates at the N-terminal Cys 126-196 loop of gp 
120 are important for envelope processing and for viral 
infectivity. For vaccine production, the N-linked 

25 glycosylation sites in the cys 126-196 loop containing 
the VI and V2 sequences preferably are to be maintained 
to provide optimum proper conformation of the gpl20 
molecule. 

More Detail ed Analysis of the Effect of Combinations of 
30 Mutants on Viral Infectivity and Envelope Processing 
Using the methods described above, additional 
combinations of mutations were introduced into the C- 
terminal portion of the gpl20 of HIV-1 in the molecular 
clone HXB2, to study the effect of these mutations on 
35 viral infectivity. The results are presented in Table 4. 
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The amino acid numbers of the first amino acid in each 
consensus sequence are listed along the top of the table. 
Mutations in any given site are indicated by a "-" 
symbol, whereas wild type consensus N-linked 
5 glycosylation sites are indicated by a symbol. It is 

clear from the data in the table that some combinations 
of mutations result in loss of, or impaired infectivity, 
while others have no effect. 

For example, in row S seven N-linked glycosylation sites 

10 have been mutated without affecting viral infectivity and 
in row W a combination of eight mutations have been 
introduced that do not affect infectivity. In contrast, 
the particular combination of seven mutations (shown in 
rows Q and T) result in impaired infectivity and 

15 additional combinations of nine and ten (see row U and V, 
respectively) significantly reduce or eliminate viral 
infectivity. It is also evident from the data that the 
N-linked glycosylation site at amino acid number 289 
plays a role in infectivity when other N-linked 

20 glycosylation sites in the C-terminal portion of the 
molecule are also mutated. Thus, it is preferable for 
the mutant protein to have a wild type residue at 
position 289 if the molecule contains additional C- 
terminal mutations. 

25 Generation of Partially Dealycosylated opl20 for Use as a 
Candidate Vaccine 

Candidate vaccine gpl20 molecules should generally 
possess the following properties: 1) they should be 
partially deglycosylated in the C-terminal portion of the 

30 molecule (defined above) to a sufficient extent to permit 
immune recognition of this portion of the molecule; and 
2) a sufficient amount of the wild type conformation of 
the molecule should be retained such that the mutant 
virus substantially retains infectivity. A recombinant 

35 gpl20 molecule which satisfies both of these criteria is 
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likely to elicit a protective immune response to reduce 
viral infect ivity . 

Recombinant gpl20 molecules derived from any 
strain of HIV-l which satisfy the criteria listed above 
5 can be generated using the methods described above* All 
that is required is a knowledge of the sequence of the 
gpl60/gpl20 gene in the particular strain of HIV-l of 
interest, which if not already available, can be obtained 
by a skilled artisan using ordinary cloning and 

10 sequencing technology such as that described in the 
Molecular Cloning Manual (Sambrook et al. , 1989, 
Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory, NY) . Potential N-linked glycosylation 
sites can be identified by locating the consensus Asp-X- 

15 Ser/Thr regions and mutations and combinations thereof, 
can be introduced into these sites as described above. 
Mutated molecules, wherein the mutations have 
substantially no effect on either infect ivity, can then 
be identified as described above. Such molecules can be 

2 0 obtained by the skilled artisan without undue 

experimentation because the techniques and tests to be 
used are common and familiar to those knowledgeable in 
the art. 

In a similar manner to that described above, gpl60 
25 molecules can be generated which are partially 

glycosylated in the C-terminal portion of gpl20. The 
methods for generating such molecules are identical to 
those described for gpl20. Partially deglycosylated 
gpl60 can also be used as a vaccine candidate provided 
30 the C-terminal end of the gpl20 portion is deglycosylated 
as described. 

To determine whether the molecule is sufficiently 
deglycosylated, its mobility on a gel be compared to wild 
type as described above. As indicated the mutation 
35 should produce a gpl20 entity of less than 90% of the 
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wild-type molecular weight. Alternatively, chemical 
techniques for quantitating sugar content are well known. 
See, e.g., Chapin et al. IRL Press (1986) pp. 178-181 and 
Methods of Carbohydrate Chemistry Vol. 7 (Whistler et al. 
5 Eds.) Academic Press (1976) p. 198 which describe acid 
hydrolysis and methanolysis . After methanolic 
hydrolysis, monosaccharides are derivatized e.g. , to 
trimethysilyl ethers of the methyl glycosides. 
Quantitation is accomplished by gas chromatography using 

10 parallel external standards of monosaccharide mixtures. 
Alternatively total sugar content of a glycoprotein of 
known amino acid sequence can be determined by mass 
spectroscopy to obtain accurate mass of glycosylated and 
unglycosylated moieties. 

15 Expression of Recombinant Partially Dealvcosylated gpl20 
Large quantities of recombinant partially 
deglycosylated gpl20 or gpl60 mutant glycoproteins can be 
obtained by expressing these proteins in a number of 
expression systems. For example, Chinese hamster ovary 

20 (CHO) cells can be trans fected with a plasmid encoding a 
mutated gpl20 or gpl60 gene, using any number of 
transf ection methods all of which are described in detail 
in Sambrook et al. (Supra) . Mutated proteins can be 
expressed in a constitutive manner under the control of 

25 its own promoter under the control of another promoter 
such as another retrovirus LTR. Alternatively, mutated 
proteins can be expressed in an inducible manner, wherein 
expression is driven by a promoter that responds to the 
addition of an inducer molecule to the transf ected cells. 

3 0 Examples of such promoters can be found in Sambrook et 
al. (Supra) . Glycoproteins that are so expressed can be 
recovered from the cells and from the cell medium using 
common biochemical techniques. See Lasky et al. Science 
233:209-212 (1986); Robey et al. Proc. Nat'l. Acad. Sci. 
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83.: 7023-7027 (1986); Pyle et al. Aids Research and Human 
Retrovirus 2:387-399 (1987). 

A baculovirus expression system can also be used 
to obtain large quantities of partially glycosylated 
5 gpl20 or gpl60. A gene encoding a mutated glycoprotein 
can be cloned into a commercially available baculovirus 
transfer plasmid. A recombinant baculovirus encoding 
such a protein can be generated as described by Summers 
and Smith (1988, A Manual of Methods for Baculovirus 

10 Vectors and Insect Cell Culture Procedures: Texas 
Agricultural Experiment Station Bulletin No. 1555, 
College Station, Texas) . The virus can be used to infect 
insect cells, such as Sf9 cells, whereupon the mutated 
glycoprotein will be expressed to high levels as the 

15 baculovirus replicates. Protein is recovered from the 
culture using ordinary standard biochemical techniques. 

The mutated proteins can also be produced as part 
of a viral particle, with or without alterations to other 
portions of the virus. See, e.g. the method of Aldovini 

20 et al. J. Virol. 64:1920-1926 (1990). 
Generation of Antibodies 

Recombinant envelope proteins can be used to 
generate antibodies using standard techniques, well known 
to those in the field. For example, the proteins are 

25 administered to challenge a mammal such as a goat, rabbit 
or mouse. The resulting antibodies can be collected as 
polyclonal sera, or antibody-producing cells from the 
challenged animal can be immortalized (e.g. by fusion 
with an immortalizing fusion partner) to produce 

30 monoclonal antibodies. Monoclonal antibody-producing 
hybridomas (or polyclonal sera) can be screened for 
antibody binding to the protein and to wild type 
envelope. They can also be screened for the ability to 
neutralize infectivity of HIV-1 isolates, preferably 

3 5 multiple (e.g., at least 3) isolates each having diverse 
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sequences in the hypervariable V3 region. By antibodies 
we include constructions using the binding (variable) 
region of such antibodies, and other antibody 
modifications . 

5 Vaccines 

The mutant envelope protein may be formulated into 
vaccines according to standard procedures known to those 
in the field. For example, procedures currently used to 
make wild-type envelope protein vaccines (e.g., 
10 Microgenysys gpl60 vaccine) can be used to make vaccines 
with the selectively deglycosylated envelope protein. 
Various modifications such as adjuvants and other viral 
or toxin components known for such vaccines or 
immunotherapeutics may be incorporated with the mutants. 
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Table 1. N-linked Glycosylation Mutants of HXB2 Envelope Glycoprotein 



MUTANT AMINO ACID 
5 INFECTIVITY 

VIRUS CHANGE 



MUTAGENIC 



OLIGONUCLEOTIDE ( 5 ' to 3 ' ) 



VIRAL 



10 



15 



20 



25 



30 



oo 


Asn 


to 


uin 


1 A\j X A ± 1 CiGTACAGGTGACAGAAAATTT 


** 


.L J O 


Asn 


U O 




luAl J. I bAAb L Aw A 1 AL i. AA 1 AC 


+ ■ 


141 


Asn 


uO 


fii in 


A X Ai^ X AA X AmLAAAly 1 Av» X AVjV—VyVxVjA 


** 

+ 


ISO 


Asn 


to 


uin 


uAlAAnLAuluClUl X ILAAlnl 


+ 


loU 




to 


Gin 


CTGCTCTTTCCAGATCAGCACAAG 


+ 


loo 


Asn 


to 


Gin 


TACCAATAGATCAGGATACTACCAGC 


+ 


197 


Asn 


to 


Gin 


TGACAAGTTGTCAGACCTCAGTCAT 


•+ 


230 


Asn 


to 


His 


TaAAATGTAATCATAAGACGTTCA 




234 


Asn 


to 


His 


ATAAGACGTTCCATGGAACAGGACCA 






Asn 


to 


Gin 


GACCATGTACACAGGTCAGCACAGTAC 




262 


Asn 


to 


Gin 


ACTGCTGTTACAAGGCAGTCTAG 


** 

+ 


276 


Asn 


to 


Gin 


TTAGATCTGTCCAGTTCACGGACAAT 


** 

+ 


289 


Asn 


to 


Gin 


TAGTACAGCTGCAGACATCTGTAGAAA 


+ 


295 


Asn 


to 


Gin 


CTGTAGAAATTCAATGTACAAGAC 


+ 


301 


Asn 


to 


His 


ACAAGACCCAACCACAATACAAGAAA 


+ 


332 


Asn 


to 


His 


GCACATTGTCACATTAGTAGAGC 


+ 


339 


Asn 


to 


Gin 


GCaAAATGGCAGAACACTTTAAAAC 


+ 


356 


Asn 


to 


Gin 


ATTCGGaAATCAGAAAACAATAATCTTTA 


+ 


386 


Asn 


to 


Gin 


TTTCTACTGTCAGTCAACACAACTG 




392 


Asn 


to 


Gin 


ACAACTGTTTCAGAGTACTTGGTTTAATAG 


+ 


397 


Asn 


to 


Gin 


GTACTTGGTTTCAGAGTACTTGGAG 


+ 


406 


Asn 


to 


His 


CTGAAGGGTCACATAACACTGAAGGA 




448 


Asn 


to 


Gin 


GATGTTCATCACAGATTACAGGGCTG 


+ 


463 


Asn 


to 


His 


GGTAATAGCAACCATGAGTCCGAGAT 


+ 


* Underlined 


type indicates mutation sites, 





** Partial impairment 
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Table 2. Third-site N-linked Glycosylation Mutants of HXB2 Envelope 
Glycoprotein 



5 MUTANT AMINO ACID MUTAGENIC VIRAL 

INFECTIVITY 

VIRUS CHANGE OLIGONUCLEOTIDE ( 5 'to 3') 



10 



15 



90 

143 

162 

199 

264 

278 



Thr to Val 
Ser to Ala 
Ser to Ala 
Thr to Glu 
Ser to Ala 
Thr to Val 



GGTAAATGTG GTCG ACAACTTTTGACATGT 

AATACCAATAGTGCATGCGGGAGAATGG 

CTGCTCTTTCAATATTGCCACAAGCATAAG 

GTTGTAACACCGAAGTCATTACACAG 

CTGCTGTTAAATGGCGCTCTAGCAGAAGAAGAG 

CTGTCAATTTCGTCGTCGACAATGCTAAA 



+ 

* 

+ 

* 

+ 

* 

+ 



* Underlined type indicates mutation sites 
** Partial impairment 



BNsr>oorr>- <wo oit 7tos a t > 



WO 93/17705 



PCT/US93/01598 



- 25 - 

Table 3. Combination N-linked glycosylation sites mutants of HXB2 env 
glycoprotein 



5 Mutant amino acid change gpl60 viral 

cleavage infect ivii= 











C2 


386/463 


+ 


+ 


C3 


397/463 




+ 


C4 


386/397/406/463 


+ 


+ 


C5 


386/392/397/406/463 




+ 


C6 


386/392/397/406/448/463 


+ 


+ 


N2 


141/186 


+ 




N3 


141/160/186 


* 

+ 




N4 


141/156/160/186 






N5 


141/136/156/160/186 







2 0 +* : severe impairment 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Essex, Myron E. , et al. 

(ii) TITLE OP INVENTION: SELECTIVELY 

DEGLYCOSYLATED HUMAN 
IMMUNODEFICIENCY VIRUS 
TYPE 1 ENVELOPE VACCINES 



(iii) NUMBER OF SEQUENCES: 



30 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson 

(B) STREET: 225 Franklin Street 

(C) CITY: Boston 

(D) STATE: Massachusetts 

(E) COUNTRY : U.S.A. 

(F) ZIP: 02110-2804 



(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.5" Diskette, 1.44 Mb 

(B) COMPUTER: IBM PS/2 Model 50Z or. 55SX 

(C) OPERATING SYSTEM: - MS-DOS (Version 5.0) 

(D) SOFTWARE: WordPerfect (Version 5.1) 

(Vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/850,770 

(B) FILING DATE: 13 Mar 1992 

(C) CLASSIFICATION: 



(Vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Freeman, John W. 

(B) REGISTRATION NUMBER: 29,066 

(C) REFERENCE/DOCKET NUMBER: 00379/016001 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 542-5070 

(B) TELEFAX: (617) 542-8906 

(C) TELEX: 200154 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
TAGTATTGGT ACAGGTGACA GAAAATTT 28 
(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNES8 : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
TGATTTGAAG CAGGATACTA ATAC 24 
(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
ATACTAATAC CCAAAGTAGT AGCGGGA 27 
(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) topology: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GATAAACAGT GCTCTTTCAA TAT 23 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CTGCTCTTTC CAGATCAGCA CAAG 24 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

TACCAATAGA TCAGGATACT ACCAGC 26 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

TGACAAGTTG TCAGACCTCA GTCAT 25 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

TAAAATGTAA TCATAAGACG TTCA 24 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 9: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
ATAAGACGTT CCATGGAACA GGACCA 26 
(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GACCATGTAC ACAGGTCAGC ACAGTAC 27 
(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ACTGCTGTTA CAAGGCAGTC TAG 23 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

TTAGATCTGT CCAGTTCACG GACAAT 26 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

TAGTACAGCT GCAGACATCT GTAGAAA 27 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

CTGTAGAAAT TCAATGTACA AGAC 24 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
ACAAGACCCA ACCACAATAC AAGAAA 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GCACATTGTC ACATTAGTAG AGC 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GCAAAATGGC AGAACACTTT AAAAC 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 18: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

ATTCGGAAAT CAGAAAACAA TAATCTTTA 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

TTTCTACTGT CAGTCAACAC AACTG 25 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

ACAACTGTTT CAGAGTACTT GGTTTAATAG 30 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GTACTTGGTT TCAGAGTACT TGGAG 25 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 22: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

CTGAAGGGTC ACATAACACT GAAGGA 26 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

GATGTTCATC ACAGATTACA GGGCTG 26 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 24: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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<D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

GGTAATAGCA ACCATGAGTC CGAGAT 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 

(B) TYPE: nucleic acid 

(C) 8TRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

GGTAAATGTG GTCGACAACT TTTGACATGT 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

AATACCAATA GTGCATGCGG GAGAATGG 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 27: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

CTGCTCTTTC AATATTGCCA CAAGCATAAG 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 28: 
(i) 8EQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) topology: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GTTGTAACAC CGAAGTCATT ACACAG 



26 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 29: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

CTGCTGTTAA ATGGCGCTCT AGCAGAAGAA GAG 33 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 30: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



CTGTCAATTT CGTCGTCGAC AATGCTAAA 
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1. A composition comprising a mutant recombinant 
human immunodeficiency virus type 1 (HIV-l) envelope 
glycoprotein which is mutated in its primary amino acid 



mutation being positioned between the C terminus of gpl20 
and the Cys at the N-terminal side of the gpl20 cysteine 



said mutant glycoprotein being sufficiently — >^ 

deglycosylated such that the total molecular mass of the 
mutant gpl20 component is less than 90% of the 
15 corresponding fully glycosylated wild type gpl2 0 

component, said mutant glycoprotein being effective, when 
present as a component of a complete HIV virion, to 
enable viral infectivity. 



20 l, wherein said virus is human immunodeficiency virus 
type l, strain selected from the group consisting of MN, 
HXB2, or IIIB, LAI, NL$3, MFA, BRVA, SC, JH3 , ALAI, BALI, 
JRCSF, OYI, SF2, NY5CG, SF162, JFL, CDC4 , SF33, AN, ADA, 
WMJ2, RF, ELI, Z2Z6, NDK, JY1, MAL, U455, Z321. - 

25 3. The mutant glycoprotein composition of claim 

1, wherein said glycoprotein is gpl60. 

4. The mutant glycoprotein composition of claim 
1, wherein said glycoprotein is gpl20. 

5. The mutant glycoprotein composition of claim 1 
30 wher in said mutant glycoproteins is deglycosylated, in 




sequence , vi*h respect to a wild_type_ HIV- lenve lope 




The mutant glycoprotein composition of claim 
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total, such that the total molecular mass of the mutant 
gpl20 component is less than 75% of the corresponding 
fully glycosylated wild-type gpl20 component, 

/S. The mutant glycoprotein composition of claim 
5 5, wherein said primary amino acid sequence is mutated 
such that one or more consensus N-linked glycosylation 
sequence mutation is a substitution of Asn, Ser, or Thr 
with a different amino acid. 

7. The mutant glycoprotein composition of claim 
10 l, selected from the group consisting of: 

a) C4 b) C5 c) C6 d) Q e) 

R 

f) S g) T h) W 

8. The mutant glycoprotein composition of claim 1 
15 wherein there are deglycosylations at multiple N-linked 

glycosylation attachment sites in the region between the 
C terminus of gpl20 and the Cys on the N-terminal side of 
the cysteine loop containing hypervariable region 4 (V4) . 

9. The mutant glycoprotein composition of claim 1 
20 in which at least one of the N-linked glycosylation 

sequences corresponding to positions 289 and 356 are not 
mutated. 

10. The mutant glycoprotein of claim 7 in which 
at least one of the N-linked glycosylation sequences 

25 corresponding to the following positions is 
deglycosylated: 386, 392, 397, 406 and 463. 

11. A vaccine for use in protection of a human 
against infection with HIV-1, said vaccine comprising the 
mutant glycoprotein composition of one of claim l. 
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12. A vaccine for use in treatment of a human 
infected with HIV-1, said vaccine comprising the mutant 
glycoprotein composition of one of claim 1. 

13. Antibodies to the mutant envelope protein of 
5 claim 1 produced by challenging a mammal with said 

envelope protein. 



14. The antibodies of claim 15 wherein said 
antibodies are monoclonal antibodies. 
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